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© Method and apparatus for Identification of document skew. 



© A method and apparatus for identifying and cor- 
recting for document skew. Lines of a bitmap are 
scanned and a variance in the number of ON (or 
OFF) pixels as a function of angle of rotation is 
calculated (4, 6, 8). Skew of the original document 
occurs when the variance is a maximum (1 0). 



Europafsches Patentamt 
© /))|) European Patent Office 

Office europeen des brevets 



■ 

(») Publication number 



CM 
< 

CM 
CO 
0> 



CO 



Q. 

ui 



Xerox Copy Centre 



1 



EP 0 431 962 A2 



2 



METHOD AND APPARATUS FOR IDENTIFICATION OF DOCUMENT SKEW 



The present invention relates to the field of 
image processing. More specifically, in one em- 
bodiment the invention provides a method and 
apparatus for determining the skew of a document 
containing text and/or graphics and, optionally, ad- 
justing the skew of the document to a desired 
angle. 

In many image processing applications it is 
desirable to determine skew of a document image. 
For example, for a text image skew is the angle at 
which lines of the text image differ from a pre- 
selected reference line, such as a horizontal line. 
Optical character recognition systems are exem- 
plary of an image processing system in which it 
would be desirable to know the angle of skew of 
text since such systems typically scan horizontally 
through a portion of a document when processing 
an image of the document If a text image is not 
properly oriented, OCR systems may have dif- 
ficulty in "recognizing" individual characters and, in 
severe cases, may actually scan through multiple 
lines of text 

A variety of methods and devices for determin- 
ing document skew have been proposed. While 
meeting with some success, prior methods have 
met with a variety of limitations. For example, some 
methods/devices require non-conventional or spe- 
cial purpose hardware, increasing the complexity 
and cost of such systems. Some systems require 
unacceptable amounts of computer processing 
time or computer processing capability. Still other 
methods operate only on text or require that spe- 
cialized data representations be utilized. 

It is an object of the invention to enable an 
improved method and apparatus for determining 
and adjusting document skew to be provided. 

The present invention provides a method and 
apparatus for identifying the skew of a document 
from a bitmap image. Image reduction is optionally 
used to enhance performance. The image is either 
rotated or vertically sheared to simulate rotation 
and a statistical measure, such as the variance of 
the number of pixels on a scanline, is maximized to 
determine skew angle. 

More specifically, the present invention pro- 
vides a method of determining skew of an image in 
a digital processing means comprising the steps of 
inputting an image of a document; and determining 
a variance in a number of ON pixels in at least one 
scan tine of said image as a function of document 
rotation angle, said skew located at a document 
rotation angle at which said variance is substan- 
tially a maximum. The present invention also pro- 
vides a method of determining skew of an image in 
a digital processing means comprising the steps of 



inputting an image of a document; and determining 
a variance in a number of OFF pixels in at feast 
one scanline of said image as a function of docu- 
ment rotation angle, said skew located at a docu- 

5 ment rotation angle at which said variance is sub- 
stantially a maximum. 

A method in accordance with the invention may 
be preceded by the step of reducing said image. 
The reduction may be by at least a scale of 4. The 

70 reduction may be a threshotded reduction. 

Variance may be calculated through a range of 
about + 5 to -5 degrees of rotation. Variance may 
be calculated at increments of about 0.2 degrees of 
rotation. 

75 A method in accordance with the invention may 
further comprise the step of rotating said image by 
an angle of about said skew so as to produce a 
deskewed document image. The step of rotating 
may use at least about two orthoganal shears. The 

20 method may further comprise the step of identify- 
ing characters in said deskewed document image 
using an optical character recognition system, or it 
may further comprise the step of reproducing said 
deskewed document image. 

25 The present invention further provides a digital 
image processing device programmed to deter- 
mine skew of an image of a document, said digital 
processing device programmed to input an image 
of said document; and determine a variance in a 

30 number of ON pixels in at least one scanline of 
said image as a function of document rotation 
angle, said skew located at a document rotation 
angle at which said variance is substantially a 
maximum. The device may be programmed to 

35 calculate said variance by an equation of the form: 

N 

£(n,)p 
i = 1 

40 

where: 

N is a number of lines; 
p is a real number greater than 1 ; 
n is the number of ON pixels in a line; and 
45 i is a counter for the lines. 

As an example, p may be programmed to be 
about 2. 

The present invention also provides a digital 
image processing device programmed to deter- 
50 mine skew of an image of a document said digital 
processing device programmed to input an image 
of said document and determine a variance in a 
number of OFF pixels in at least one scanline of 
said image as a function of document rotation 
angle, said skew located at a document rotation 
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angle at which said variance is substantially a 
maximum. The device may be programmed to 
calculate said variance by an equation of the form: 

N 

E(rn)p 
1 = 1 

where: 

N is a number of lines; 

p is a real number greater than 1 ; 

n is the number of OFF pixels in a line; and 

i is a counter for the lines. 

As an example, p may be programmed to be 
about 2. 

A device in accordance with the invention may 
be further programmed to reduce said image be- 
fore determining skew of said image. The reduction 
may be by at least a scale of 4. The reduction may 
be a thresholded reduction. 

A device in accordance with the invention may 
be programmed to input substantially all of the 
scanlines in a document to calculate said variance. 
A device in accordance with the invention may be 
programmed to simulate said rotation angle by a 
vertical shear of said image; and/or to calculate 
said variance over a range of about ± 5 degrees of 
rotation; and/or to calculate said variance at incre- 
ments of about 02. degrees of rotation. 

A device in accordance with the invention may 
be programmed to rotate said image by an angle 
of about said skew so as to produce a deskewed 
document image, for example by using at least 
about two orthoganal shears. The device may fur- 
ther comprise an optical character recognition sys- 
tem for identifying characters in said image, or it 
may further comprise reproduction means for re- 
producing said deskewed document image. 

One embodiment of the invention comprises an 
optical character recognition system which includes 
input means for inputting an image of a document, 
deskewing means comprising: (i) means for rotat- 
ing the image frc^n at least a first skew angle to a 
second skew angle; (ii) means for calculating vari- 
ance of a number of ON pixels in a scanline of the 
image at each of the skew angles; and (iii) means 
for rotating the image to about an angle which 
substantially maximizes the variance to produce a 
deskewed image. The optical character recognition 
system further includes means for identifying char- 
acters in the deskewed image; and means for 
outputting the characters in the image. 

Another embodiment of the invention com- 
prises apparatus for copying a document, the ap- 
paratus including input means for inputting an im- 
age of a document, and deskewing means which 
includes: (i) means for rotating the image from at 
least a first skew angle to a second skew angle; (ii) 



means for calculating variance of a number of ON 
pixels in a scanline of the image at each of the 
skew angles; and (i») means for rotating the image 
to about an angle which substantially maximizes 
s the variance to produce a deskewed image of the 
document The apparatus further includes means 
for outputting the deskewed image of the docu- 
ment 

By way of example only, embodiments of the 
10 invention will now be described with reference to 
the attached drawings, in which: 

Fig. 1 A is an overall block diagram illustrating an 
image analysis system and Fig. 1B is a 
flowchart illustrating a method in accordance 
15 with the invention; 

Figs. 2A and 2B are a text image and a plot of 
variance as a function of skew angle, respec- 
tively; 

Rgs. 3A and 3B are a text image and plot of 
20 variance as a function of skew angle, respec- 
tively, for a reduced image; and 
Rg. 4 illustrates apparatus for carrying out a 
reduction operation. 

25 I. Definitions and Terminology 

The present discussion deals with binary im- 
ages. In this context the term "image" refers to a 
representation of a two-dimensional data structure 

30 composed of pixels. A binary image is an image 
where a given pixel is either "ON" or "OFF." 
Binary images are manipulated according to a 
number of operations wherein one or more source 
images are mapped onto a destination image. The 

35 results of such operations are generally referred to 
as images. The image that is the starting point for 
processing will sometimes be referred to as the 
original image. 

Pixels are defined to be ON if they are black 

40 and OFF if they are white. It should be noted that 
the designation of black as ON and white as OFF 
reflects the fact that most documents of interest 
have a black foreground and a white background. 
While the techniques to be described could be 

45 applied to negative images as well, the discussion 
will be in terms of black on white. In some cases 
the discussion makes reference to a "don't care" 
pixel which may be either an ON or an OFF pixel. 
A "solid region" of an image refers to a region 

so extending many pixels in both dimensions within 
which substantially ail the pixels are ON. 

A "textured region" of- an image refers to a 
region that contains a relatively fine-grained pat- 
tern. Examples of textured regions are halftone or 

55 stippled regions. 

AND, OR, and XOR are logical operations car- 
ried out between two images on a pixel-by-pixel 
basis. 
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NOT is a logical operation carried out on a 
single image on a pixel-by-pixel basis. 

"Expansion" is a scale operation characterized 
by a SCALE factor N, wherein each pixel in a 
source image becomes an NxN square of pixels, 
all having the same value as the original pixel. 

"Reduction" is a scale operation characterized 
by a SCALE factor N and a threshold LEVEL M. 
Reduction with SCALE = N entails dividing the 
source image into NxN squares of pixels, mapping 
each such square in the source image to a single 
pixel on the destination image. The value for the 
pixel in the destination image is determined by the 
threshold LEVEL M, which is a number between 1 
and N 2 . If the number of ON pixels in the pixel 
square is greater or equal to M, the destination 
pixel is ON, otherwise it is OFF. 

"Subsampling - is an operation wherein the 
source image is subdivided into smaller (typically 
square) elements, and each element in the source 
image is mapped to a smaller element in the 
destination image. The pixel values for each des- 
tination image element are defined by a selected 
subset of the pixels in the source image element. 
Typically, subsampling entails mapping to single 
pixels, with the destination pixel value being the 
same as a selected pixel from the source image 
element The selection may be predetermined 
(e.g., upper left pixel) or random. 

A "4-connected region" is a set of ON pixels 
wherein each pixel in the set is laterally or verti- 
cally adjacent to at least one other pixel in the set 

An "8-connected region" is a set of ON pixels 
wherein each pixel in the set is laterally, vertically, 
or diagonally adjacent to at least one other pixel in 
the set 

A number of morphological operations map a 
source image onto an equally sized destination 
image according to a rule defined by a pixel pat- 
tern called a structuring element (SE). The SE is 
defined by a center location and a number of pixel 
locations, each having a defined value (ON or 
OFF). The pixels defining the SE do not have to be 
adjacent each other. The center location need not 
be at the geometrical center of the pattern; indeed 
it need not even be inside the pattern. 

A "solid" SE refers to an SE having a periph- 
ery within which all pixels are ON. For example, a 
solid 2x2 SE is a 2x2 square of ON pixels. A solid 
SE need not be rectangular. 

A "hit-miss" SE refers to an SE that specifies 
at least one ON pixel and at least one OFF pixel. 

"Erosion" is a morphological operation wherein 
a given pixel in the destination image is turned ON 
if and only if the result of superimposing the SE 
center on the corresponding pixel location in the 
source image results in a match between all ON 
and OFF pixels in the SE and the underlying pixels 



in the source image. 

"Dilation" is a morphological operation wherein 
a given pixel in the source image being ON causes 
the SE to be written into the destination image with 
5 the SE center at the corresponding location in the 
destination image. The SE*s used for dilation typi- 
cally have no OFF pixels. 

"Opening" is a morphological operation that 
consists of an erosion followed by a dilation. The 
10 result is to replicate the SE in the destination 
image for each match in the source image. 

"Closing" is a morphological operation consist- 
ing of a dilation followed by an erosion. 

A "mask" refers to an image, normally derived 
75 from an original image, which contains substantially 
solid regions of ON pixels in a region of interest in 
the original image. 

The various operations defined above are 
sometimes referred to in noun, adjective, and verb 
20 forms. For example, references to dilation (noun 
form) may be in terms of dilating the image or the 
image being dilated (verb forms) or the image 
being subjected to a dilation operation (adjective 
form). No difference in meaning is intended. 

25 

II. Details of Operation of Embodiments of the 
Invention 



The present invention relates to a method and 

30 apparatus for determining skew directly from a 
bitmap. In preferred embodiments, all or substan- 
tially all pixels in the bitmap are utilized. The vari- 
ance of the number of ON pixels in a scanline is 
determined as a function of the angle of rotation of 

55 the bitmap. This variance will be substantially maxi- 
mized when the image has very close to zero 
skew. Since the method uses all the pixels in the 
image, it will work on images with only text, only 
line graphics (if there are some horizontal lines), 

40 and on some images with mixed text stipples 
and/or halftones. 

Fig. 1A is a block diagram of an image analysis 
system 100. The basic operation of system 100 is 
to extract or eliminate certain characteristic por- 

45 tons of a document 102. To this end, the system 
includes a scanner 103 which digitizes the docu- 
ment on a pixel basis, and provides a resultant 
data structure. Depending on the application, the 
scanner may provide a binary image (a single bit 

so per pixel) or a gray scale image (a plurality of bits 
per pixel). This data structure contains the raw 
content of the document, to the precision of the 
resolution of the scanner. This data structure, typi- 
cally referred to as an image, may be sent to a 

55 memory 104 or stored as a file in a file storage unit 
105, which may be a disk or other mass storage 
device. 

A processor 106 controls the data flow and 
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performs the image processing. Processor 106 
may be a general purpose computer, a special 
purpose computer optimized for image processing 
operations, or a combination of a general purpose 
computer and auxiliary special purpose hardware. If 
a file storage unit is used, the image is transferred 
to memory 104 prior to processing. Memory 104 
may also be used to store intermediate data struc- 
tures and possibly a final processed data structure. 

The result of the image processing can be a 
derived image, numerical data (such as coordinates 
of salient features of the image) or a combination. 
This information may be communicated to 
application-specific hardware 108, which may be a 
printer or display, or may be written back to file 
storage unit 105. 

Fig. 1 B is a flowchart illustrating a method for 
determining and correcting image skew. In step 2, 
the input binary image is optionally reduced so as 
to accelerate operation. In preferred embodiments, 
where the image is scanned at 300 pixels/inch, a 
reduction of about 4x to 8x is utilized and has 
provided satisfactory results while significantly in- 
creasing the operational speed. While a conven- 
tional "reduction" step is used for purposes of 
illustration herein, other forms of reduction will also 
perform satisfactorily such as simple sub-sampling 
or a thresholded reduction. 

In steps 4 to 6 the variance in the number of 
ON pixels is calculated for a particular angle of 
rotation of the image. As used herein, the 
"variance" in the number of ON pixels may mean: 




c, c 2 



where: 

N is the number of lines utilized; 

n is the number of ON pixels in a line; 

i is a counter for the lines; and 

Ci and C2 are constants which may generally be 

ignored. 

Although not critical to the method, substan- 
tially all of the lines of the image are utilized in 
calculating the variance. When similar documents 
will be encountered from scan to scan, it will be 
possible to utilize only a select sampling of scan 
lines or even a single scan line. In the above 
formula, ail terms except the sum of the square of 
the number of pixels in a line will be substantially 
constant and can be ignored. Therefore, in one 



embodiment, only the sum of the square of the 
number of ON pixels on each scanline as a func- 
tion of angle of rotation is calculated. This differs 
from the variance by a constant independent of 
5 skew. 

While the method for determining skew is illus- 
trated herein by way of the calculation of the 
square of the number of pixels in a scanline, it will 
be recognized that other powers could be utilized; 

10 any power greater than 1 of the number of ON 
pixels in a scanline could be effectively utilized. 
Taking the square is preferred because it is typi- 
cally faster than any other power. For brevity the 
above formula in any of its alternative forms is 

rs referred to as "variance" herein. 

In step 8, the variance is calculated at a wide 
variety of relatively closely spaced rotation angles, 
for example at increments of about 0.2 degrees of 
rotation. In some embodiments, the image is not 

20 actually rotated in order to calculate variance, but 
instead vertical shear is applied to the image. In a 
vertical shear operation, vertical blocks of the im- 
age are moved upward (or downward) some num- 
ber of pixels, and the number of pixels shifted 

25 increases by one for each block to the right. The 
horizontal width of each block varies inversely with 
the shear angle. For example, to simulate a rotation 
by one degree (approximately 1/57th of a radian), 
the first vertical block of width 28 pixels is left 

30 unchanged, while the next vertical block of width 
57 pixels is shifted up by one pixel, the next 
vertical block of width 57 pixels is shifted up by 
two pixels, etc. 

In preferred embodiments, the image is rotated 

35 both positively and negatively about -5 to + 5 
degrees (0 max ) since, in most applications, docu- 
ment skew will not exceed this amount. After cal- 
culation of the variance at a number of rotation 
angles, the angle at which the variance is substan- 

40 tially a maximum is located at step 10 by means 
well known to those of skill in the art The maxi- 
mum in variance need not be located exactly, but it 
is preferred that it should be located to within at 
least about ±20% and most preferably within about 

45 ±10%. In some embodiments, the maximum is 
located by evaluating the variance at a variety of 
increments between ±0 max and interpolating to find 
the maximum. Other more sophisticated methods 
may alternatively be used such as interval halving 

50 techniques. The angle at which the variance is a 
maximum will be very close to the angle of skew of 
the document 

The advantage of performing the operations at 
reduced scale is that the time to do each operation 

55 (such as vertical shearing or counting) varies with 
the square of the reduction factor. Thus, a reduc- 
tion by a factor of 4 reduces the computation time 
by a factor of 16. 
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Of course, there is a toss in accuracy when a 
reduced image is used. The relation between the 
accuracy and the resolution is as follows: the an- 
gular resolution (in radians) of the deskew algo- 
rithm is approximately 1 /(image width in pixels). 
Thus, if the original image is 2500 pixels wide and 
is reduced by a factor of 8, the angular resolution 
will be about 1/300, or about 0.2 degrees which is 
acceptable in most commercial applications. In 
practice, the error is typically found to be smaller 
than this. 

Optionally in step 12 the original image is 
rotated by the negative of the angle determined in 
step 10 so as to deskew the image. Skew correc- 
tion is achieved by rotating the image. Efficient 
means for rotating an image using bitbft (i.e.. raster 
o (Derations) exist are known and are discussed in, 
for example, Paeth, "A Fast Algorithm for Fast 
Raster Rotatin, a Proa Vision Interface '86, Vancou- 
ver B.C., May 1986, pg. 77-81, which" is incor- 
porated by reference herein for all purposes. For 
small angles of rotation (say, less than 5 degrees), 
rotation can be approximated by two orthogonal 
shears. For larger angles, rotation can be approxi- 
mated by three shears, with the first and third 
shear of equal magnitude in the horizontal direction 
and the second larger shear in the vertical direc- 
tion. 

One requirement of efficient operation, is that 
thresholded reduction must be done quickly. Sup- 
pose it is desired to reduce an image by a factor of 
two in the vertical direction. One way to do this is 
to use a raster operation (bitblt - bit block transfer) 
to logically combine the odd and even rows, creat- 
ing a single row of the reduced image for each pair 
of rows in the original. The same procedure can 
then be applied to the columns of the vertically 
squashed image, giving an image reduced by a 
factor of two in both directions. 

The result, however, depends on the logical 
operations of the horizontal and vertical raster oper- 
ations. Obtaining a result with LEVEL = 1 or 4 is 
straightforward. If an OR is used for both raster 
operation orientations, the result is an ON pixel if 
any of the four pixels within the corresponding 2x2 
square of the original were ON. This is simply a 
reduction with LEVEL = 1. Likewise, if an AND is 
used for both raster operation orientations, the re- 
sult is a reduction with LEVEL = 4, where all four 
pixels must be ON. 

A somewhat different approach is used to ob- 
tain a reduction with LEVEL = 2 or 3. Let the result 
of doing a horizontal OR followed by a vertical AND 
be a reduced image R1, and let the result from 
doing a horizontal AND followed by a vertical OR 
be a reduced image R2. A reduction with LEVEL = 
2 is obtained by ORing R1 with R2, and a reduc- 
tion with LEVEL = 3 is obtained by ANDing R1 



with R2. 

The procedure may not be computationally ef- 
ficient if implemented as described above. On 
some computers, such as Sun workstations, raster 

5 operations are done in software. The image is 
stored as a block of sequential data, starting with 
the first row of the image, moving left-to-right, then 
the second row, etc. Consequently, the raster oper- 
ations between rows are fast, because 16 or 32 bits 

10 in two words can be combined in one operation. 
But to perform a raster operation between two 
columns, the corresponding bits must be found, 
two bits at a time (one from each column), before 
the logical operations can be done. It turns out that 

is the time, per pixel, to do the vertical raster oper- 
ations is at least 25 times greater than the horizon- 
tal ones. In fact, when the method is implemented 
entirely with raster operations, over 90 percent of 
the time is devoted to the vertical operations. 

20 Fortunately, there is a simple and very fast way 
to implement the logical operations between col- 
umns. Rather than use column raster operations, 
take 16 sequential bits, corresponding to 16 col- 
umns in one row. These 16 bits can be accessed 

25 as a short integer. These 16 bits are used as an 
index into a 2 16 -entry array (i.e., a lookup table) of 
8-bit objects. The 8-bit contents of the array give 
the result of ORing the first bit of the index with the 
second, the third bit with the fourth... and on to the 

30 15th bit with the 16th. Actually, two arrays are 
needed, one for ORing the 8 sets of adjacent 
columns, and one for ANDing the columns. It 
should be understood that the numerical example 
is just that, an example. It is also possible to 

35 implement this as a 2 s -entry array of 4-bit objects, 
or any one of a number of other ways. 

The use of lookup tables to implement column 
logical operations is about as fast, per pixel, as 
Sun's row raster operations. A 1000x1000 pixel 

40 image can be reduced on a Sun 3/260, with either 
LEVEL = 1 or 4, to a 500x500 pixel image in 0.10 
second. On a Sun 4/260, the operation takes about 
0.06 second. 

As discussed above, 2x2 reductions require a 

45 first logical operation between rows followed by a 
second, possibly different, logical operation be- 
tween columns. Moreover, some threshold levels 
require two intermediate reduced images which are 
then combined. The table lookup technique for 

so column operations can become cumbersome if it is 
desired to have a very wide pixelword. Either the 
table becomes enormous or one needs special 
techniques of looking up parts of the wide pixel- 
word in multiple parallel tables. The latter, while 

55 clearly superior, does require some way to use 
portions of the data word as memory addresses, 
which may not otherwise be necessary. 

Fig. 4 is a logic schematic of specialized hard- 
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ware for performing a logical operation between 
vertically adjacent 2Q-bit pbcelwords and a pairwise 
bit reduction of the resulting 2Q-bit pixelword (bits 
0 through 2CM). Although the drawing shows a 16- 
pixel word, the benefits of this hardware would 
become manifest for much longer pixelwords 
where the lookup table technique has become 
cumbersome. A 512-bit pixelword is contemplated, 
since a line of image would represent only a few 
pixelwords. 

The reduction of the two pixelwords occurs in 
two stages, designated 200 and 202. In the first 
stage, a vertically adjacent pair of pixelwords is 
read from a first memory 203, and the desired first 
logical operation is carried out between them. The 
desired second logical operation is then carried out 
between the resulting pixelword and a version of 
the pixelword that is shifted by one bit. This pro- 
vides a processed pixelword having the bits of 
interest (valid bits) in every other bit position. In the 
second stage, the valid bits in the processed pixel- 
word are extracted and compressed, and the result 
stored in a second memory 204. Memory 203 is 
preferably organized with a word size correspond- 
ing to the pixelword size. Memory 204 may be 
organized the same way. 

The preferred implementation for stage 200 is 
an array of bit-slice processors, such as the IDT 
49C402 processor, available from Integrated De- 
vice Technology. This specific processor is a 16-bit 
wide device, each containing 64 shiftable registers. 
Thirty-two such devices would be suitable for a 
512-bit pixelword. For simplification, only four reg- 
isters 205,206.207, and 208 of one 16-bit device 
210 are shown. Among the processor's operations 
are those that logically combine the contents of 
first and second registers, and store the result in 
the first The processor has a data port 215, which 
is coupled to a data bus 217. 

Second stage 202 includes first and second 
latched transceivers 220 and 222, each half as 
wide as the pixelword. Each transceiver has two 
ports, designated 220a and 220b for transceiver 
220 and 222a and 222b for transceiver 222. Each 
transceiver is half as wide as the pixelword. Ports 
220a and 222a are each coupled to the odd bits of 
data bus 217, which correspond to the bits of 
interest Port 220b is coupled to bits 0 through (G- 
1) of the data bus. while port 222b is coupled to 
bits Q through (2Q-1). The bus lines are pulled up 
by resistors 125 so that undriven lines are pulled 
high. 

Consider the case of a 2x2 reduction with 
LEVEL = 2. The sequence of operations requires 
that (a) a vertically adjacent pair of pixelwords be 
ANDed to form a single 2Q-bit pixelword, adjacent 
pairs of bits be ORed to form a Q-brt pixelword, 
and the result be stored; (b) the vertically adjacent 



pair of pixelwords be ORed, adjacent bits of the 
resultant 2Q-bit pixelword be ANDed, and the re- 
sultant Q-bit pixelword be stored; and (c) the two 
Q-bit pixelwords be ORed. 

5 To effect this, a pair of vertically adjacent pixel- 

words are read from first memory 203 onto data 
bus 217 and into registers 205 and 206. Registers 
205 and 206 are ANDed and the result stored in 
registers 207 and 208. The content of register 208 

70 is shifted one bit to the right registers 207 and 208 
are ORed, and the result is stored in register 208. 
Registers 205 and 206 are ORed, and. the result 
stored in registers 206 and 207. The content of 
register 207 is right shifted by one bit. registers 

is 206 and 207 are ANDed. and the result stored in 
register 207. 

At this point, register 207 contains the result of 
ORing the two pixelwords and ANDing pairs of 
adjacent bits, while register 208 contains the result 

20 of ANDing the pixelwords and ORing pairs of adja- 
cent bits. However, registers 207 and 208 contain 
the valid bits in the odd bit positions 1, 3,...(2Q-1). 
For a reduction with LEVEL = 2, registers 207 and 
208 are ORed and the result is made available at 

25 processor data port 215 which is coupled to data 
bus 217. 

The odd bits of the data bus are latched into 
transceiver 220 through port 220a, resulting in a Q- 
bit pixelword with the valid bits in adjacent posi- 

30 tions. Although this Q-bit entity could be read back 
onto the bus and transferred to memory 204, it is 
preferable to use both latches. Thus, two new pix- 
elwords (horizontally adjacent to the first two) are 
processed at stage 200 as described above, the 

35 result is made available at processor data port 21 5, 
and is latched into transceiver 222 through port 
222a. The contents of the two transceivers are then 
read out through ports 220b and 222b onto data 
bus 217 in order to provide a 20-bit pixelword that 

40 represents the reduction of four 20-bit pixelwords. 
The result is transferred to second memory 204. 
This overall sequence continues until all the pixel- 
words in the pair of rows has been processed. 
Once the pair of rows has been processed, subse- 

45 quent pairs are similarly processed. 

As mentioned above each bit-slice processor 
has 64 registers. Accordingly, since memory ac- 
cesses are more efficient in a block mode, faster 
operation is likely to result if 8 pairs of pixelwords 

so are read from memory 203 in a block, processed 
as discussed above, stored in the processor's reg- 
isters, and written to memory 204 in a block. 

Image enlargement is similar, but the steps are 
executed in the reverse order. First, the processor 

55 reads a pixelword and sends the left half through 
port 220b of transceiver 220. This is read onto the 
bus through port 220a Only every other pixel in 
the resulting word on the bus will initially be valid, 
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so the processor will need to validate all the pixels 
using a sequence of shifts and logic operations. 
Since resistors 225 pull up all the bus lines that are 
not driven, each undriven line, all the even bits in 
this case, will be 1's. This expanded pixelword. 
which alternates 1's with valid data, is read into two 
registers, the content of one register is shifted one 
place, and the registers are logically ANDed. Ev- 
erywhere there was a 0 in an odd bit, there will be 
00 in an even/odd pair. None of the other bits will 
be affected. This pixelword is then written to two 
vertically adjacent words in the expanded image. 
This process is repeated for the right half of the 
pixelword using the transceiver 222. The processor 
expands the entire row one pixelword at a time and 
the entire image one row at a time. 

III. Graphical Illustrations 

Rgs. 2 and 3 illustrate the method for deter- 
mining skew, in graphical form. A 2x reduction is 
used to compute the skew angle for the text image 
shown in Fig. 2A. The variance for the image 
shown in Fig. 2A is shown in Rg. 20. The cal- 
culated skew is found to be -0.60 degrees. In Rg. 3 
a 4x reduction (shown in Rg. 3B) is used to com- 
pute the skew angle for the image. The variance 
curve for the image shown in Rg. 3A is shown in 
Rg. 3B. The calculated variance is a maximum at 
-0.66 degrees. The actual skew angle is about 
-0.60 degrees, showing typical accuracy of the 
method. 

A method/apparatus in accordance with the 
present invention as described above provides a 
fast and accurate means of determining document 
skew. It is to be understood that the above descrip- 
tion is intended to be illustrative and not restrictive. 
For example, while the method is illustrated above 
primarily with reference to maximizing the variance 
in the number of ON pixels in a scanline, it could 
readily be applied by optimizing the variance in the 
number of OFF pixels in a scanline as a function of 
rotation. 

Methods/apparatus in accordance with the in- 
vention will have a variety of applications including 
use within a document recognition system. The 
accuracy of most optical character recognition 
(OCR) systems, for example, degrades badly when 
text is skewed by more than about 5%. Using a 
method as described herein, an OCR system may 
quickly check for skew and adjust the skew of the 
document accordingly. The OCR system may then 
identify characters in the deskewed document im- 
age. 

Alternatively, methods/apparatus in accordance 
with the present invention could be used within an 
electro-reprographic (ER) copier. The human eye 
can detect even a small amount of document skew 



but, through use of a method/apparatus in accor- 
dance with the invention, ER copiers may check 
and compensate for document skew without me- 
chanical complexity. The deskewed document im- 

5 age may then be reproduced. That will be particu- 
larly useful in cases when an original that is being 
copied is skewed. Other possible applications of 
methods/apparatus in accordance with the inven- 
tion include, but are not limited to, skew correction 

ro for image coding, and the like. 

Claims 

75 1. A method of determining skew of an image in 
a digital processing means comprising the 
steps of determining (4.6,8) a variance either in 
a number of ON pixels or in a number of OFF 
pixels in at least one scanline of said image as 

20 a function of document rotation angle, and 
locating (10) the document rotation angle at 
which said variance is substantially a maxi- 
mum. 

25 2. A method as claimed in claim 1, preceded by 
the step (2) of reducing said image. 

3. A method as claimed in claim 1 or claim 2, 
wherein said variance is calculated by an 

30 equation of the form: 

NE (n,)p 
1 = 1 

35 where: 

N is a number of lines; 

p is a real number greater than 1 ; 

n is the number of the said pixels in a line; and 

i is a counter for the lines. 

40 

4. A method as claimed in claim 3, wherein p is 
about 2. 

5- A method as claimed in any one of the preced- 
es ing claims, wherein substantially all of the 
scanlines in a document are included the de- 
termination of said variance. 

6. A method as claimed in any one of the preced- 
so ing claims, wherein document rotation angles 

are simulated by a vertical shear of said im- 
age. 

7. A method as claimed in any one of the preced- 
55 ing claims, and further comprising the step 

(12) of rotating said image by the said located 
angle so as to produce a deskewed document 
image. 
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a A digital image processing device programmed 
to carry out a method as claimed in any one of 
the preceding claims. 

9. An optical character recognition system com- s 
prising: 

a) input means for inputting an image of a 
document; 

b) deskewing means comprising: 

i) means for simulating rotation of said to 
image from at least a first skew angle to 
a second skew angle; 
it) means for calculating variance of a 
number of ON pixels in a scanline of said 
image at each of said skew angles; and ts 
iii) means for rotating said image to 
about an angle which substantially maxi- 
mizes said variance to produce a de- 
skewed image; 

c) means for identifying characters in said 20 
deskewed image; and 

d) means for outputting said characters in 
said image. 

10. Apparatus for copying a document comprising: 25 

a) input means for inputting an image of a 
document; 

b) deskewing means comprising: 

i) means for simulating rotation of said 
image from at least a first skew angle to 30 
a second skew angle; 

ii) means for calculating variance of a 
number of ON pixels in a scanline of said 
image at each of said skew angles; and 

iii) means for rotating said image to 35 
about an angle which substantially maxi- 
mizes said variance to produce a de- 
skewed image of said document; 

c) means for outputting said deskewed im- 
age of said document 40 
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